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IN THE CLAIMS 

1 . (Original) A computer-based method of performing document retrieval in accordance with 
an information network, the method comprising the steps of: 

retrieving one or more documents from the information network that satisfy a user-defined 
predicate; 

collecting statistical information about the one or more retrieved documents as the one or 
more retrieved documents are analyzed; and 

using the collected statistical information to automatically determine fiirther document 
retrieval operations. 

2. (Original) The method of claim 1, wherein the user-defined predicate specifies content 
associated with a document. 

3. (Original) The method of claim 1 , wherein the statistical information collection step uses 
content of the one or more retrieved documents. 

4. (Original) The method of claim 1, wherein the statistical information collection step 
considers whether the user-defined predicate has been satisfied by the one or more retrieved 
documents. 

5 . (Currently Amended) The method of claim 1 , wherein the collected statistical information 
is used to direct further document retrieval operations toward documents which are more likely to 
satisfy the predicate than would otherwise occur with respect to document retrieval operations that 
are not directed using the collected statistical information . 

6. (Original) The method of claim 1 , wherein the collected statistical information is used to 
direct further document retrieval operations toward documents which are similar to the one or more 
retrieved documents that also satisfy the predicate. 
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7. (Original) The method of claim 1, wherein the collected statistical information is used to 
direct further document retrieval operations toward documents which are linked to by other 
documents which also satisfy the predicate. 

8. (Original) The method of claim 1 , wherein the information network is the world wide web 
and a document is a web page. 

9. (Original) The method of claim 8, wherein the statistical information collection step uses 
one or more uniform resource locator tokens in the one or more retrieved web pages. 

10. (Original) Apparatus for performing document retrieval in accordance with an 
information network, the apparatus comprising: 

at least one processor operative to: (i) retrieve one or more documents from the information 
network that satisfy a user-defmed predicate; (ii) collect statistical information about the one or more 
retrieved documents as the one or more retrieved documents are analyzed; and (iii) use the collected 
statistical information to automatically determine further document retrieval operations. 

11. (Original) The apparatus of claim 10, wherein the user-defined predicate specifies 
content associated with a document. 

12. (Original) The apparatus of claim 10, wherein the statistical information collection 
operation uses content of the one or more retrieved documents. 

13. (Original) The apparatus of claim 10, wherein the statistical information collection 
operation considers whether the user-defined predicate has been satisfied by the one or more 
retrieved documents. 
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14. (Currently Amended) The apparatus of claim 10, wherein the collected statistical 
information is used to direct further document retrieval operations toward documents which are more 
likely to satisfy the predicate than would otherwise occur with respect to document retrieval 
operations that are not directed using the collected statistical information . 

1 5 . (Original) The apparatus of claim 1 0, wherein the collected statistical information is used 
to direct further document retrieval operations toward documents which are similar to the one or 
more retrieved documents that also satisfy the predicate. 

1 6. (Original) The apparatus of claim 1 0, wherein the collected statistical information is used 
to direct further document retrieval operations toward documents which are linked to by other 
documents which also satisfy the predicate. 

17. (Original) The apparatus of claim 10, wherein the information network is the world wide 
web and a document is a web page. 

18. (Original) The apparatus of claim 17, wherein the statistical information collection 
operation uses one or more uniform resource locator tokens in the one or more retrieved web pages. 

19. (Original) An article of manufacture for performing document retrieval in accordance 
with an information network, comprising a machine readable medium containing one or more 
programs which when executed implement the steps of: 

retrieving one or more documents from the information network that satisfy a user-defined 
predicate; 

collecting statistical information about the one or more retrieved documents as the one or 
more retrieved documents are analyzed; and 

using the collected statistical information to automatically determine further document 
retrieval operations. 
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20. (Original) The article of claim 19, wherein the user-defined predicate specifies content 
associated with a document. 

2 1 . (Original) The article of claim 1 9, wherein the statistical information collection step uses 
content of the one or more retrieved documents. 

22. (Original) The article of claim 19, wherein the statistical information collection step 
considers whether the user-defined predicate has been satisfied by the one or more retrieved 
documents. 

23. (Currently Amended) The article of claim 19, wherein the collected statistical 
information is used to direct further document retrieval operations toward documents which are more 
likely to satisfy the predicate than would otherwise occur with respect to document retrieval 
operations that are not directed using the collected statistical information . 

24. (Original) The article of claim 19, wherein the collected statistical information is used 
to direct further document retrieval operations toward documents which are similar to the one or 
more retrieved documents that also satisfy the predicate. 

25. (Original) The article of claim 19, wherein the collected statistical information is used 
to direct further document retrieval operations toward documents which are linked to by other 
documents which also satisfy the predicate. 

26. (Original) The article of claim 19, wherein the information network is the world wide 
web and a document is a web page. 

27. (Original) The article of claim 26, wherein the statistical information collection step uses 
one or more uniform resource locator tokens in the one or more retrieved web pages. 
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IN THE ABSTRACT 

Please amend the Abstract as follows: 

Methods and apparatus for performing intelligent crawling are provided. Particularly, the 
int e lligent crawling techniques of the invention provide a crawler mechanism which is capable of 
learning as it crawls in order to focus the search for documents on the information network being 
explored, e.g., world wide web. This The crawler mechanism stores information about the crawled 
documents as it retrieves the documents, and then uses the information to further focus its search 
appropriately. Th e inv e n t ive t ccliniqu c s r esul t in the crawling of a small percen t age of the 
documen t s on the world wid e web. 



3 



